Intelligent System for Speaker Identification using Lip features with PCA and ICA
نویسندگان
چکیده
Biometric authentication techniques are more consistent and efficient than conventional authentication techniques and can be used in monitoring, transaction authentication, information retrieval, access control, forensics, etc. In this paper, we have presented a detailed comparative analysis between Principle Component Analysis (PCA) and Independent Component Analysis (ICA) which are used for feature extraction on the basis of different Artificial Neural Network (ANN) such as Back Propagation (BP), Radial Basis Function (RBF) and Learning Vector Quantization (LVQ). In this paper, we have chosen “TULIPS1 database, (Movellan, 1995)” which is a small audiovisual database of 12 subjects saying the first 4 digits in English for the incorporation of above methods. The six geometric lip features i.e. height of the outer corners of the mouth, width of the outer corners of the mouth, height of the inner corners of the mouth, width of the inner corners of the mouth, height of the upper lip, and height of the lower lip which extracts the identity relevant information are considered for the research work. After the comprehensive analysis and evaluation a maximum of 91.07% accuracy in speaker recognition is achieved using PCA and RBF and 87.36% accuracy is achieved using ICA and RBF. Speaker identification has a wide scope of applications such as access control, monitoring, transaction authentication, information retrieval, forensics, etc. Keywords—Biometric authentication; Intelligent System; Lip Features; Independent Component Analysis (ICA;, Principal Component Analysis (PCA); Back Propagation (BP); Radial Basis Function (RBF); Learning Vector Quantization (LVQ). —————————— ——————————
منابع مشابه
PSO Based Optimized Reliability for Robust Multimodal Speaker Identification
Speaker recognition in real environment with reliable mode is a key challenge for ubiquitous service in human computer interface. In this paper, we present a robust multimodal speaker identification system with optimized reliability of different modalities. We propose an extension of modified convection function’s optimizing factors to account optimum reliability simultaneously in audio, face a...
متن کاملSpeaker recognition using MPEG-7 descriptors
Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers....
متن کاملUnsupervised Extraction of Multi-Frame Features for Lip-Reading
The features of human lip motion from video clips are extracted by three unsupervised learning algorithms, i.e., Principal Component Analysis (PCA), Independent Component Analysis (ICA), and Non-negative Matrix Factorization (NMF). Since the human perception of facial motion goes through two different pathways, i.e., the lateral fusifom gyrus for the invariant aspects and the superior temporal ...
متن کاملDiscrimination Analysis of Lip Motion Features for Multimodal Speaker Identification and Speech-reading
In this thesis a new multimodal speaker/speech recognition system that integrates audio, lip texture, lip geometry, and lip motion modalities is presented. There have been several studies that jointly use audio, lip intensity and/or lip geometry information for speaker identification and speech-reading applications. This work proposes using explicit lip motion information, instead of or in addi...
متن کاملPerformance Analysis of Robust Method to Identify the Speaker Using Lip Segmentation
This document addresses the problem of providing security to vehicles based on a unique biometric feature that is lip motions. This work proposes the use of explicit lip motion features for speaker identification so that the car can be unlocked depending on the identification process results. For identification process, lip boundaries are tracked over the images and compared to the database. Fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1004.4478 شماره
صفحات -
تاریخ انتشار 2010